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1 SEQUENCE LISTING 

2 £^ 

3 ( 1 ) General Information 
4 

5 (i) APPLICANT: Ullrich,, Axel 

6 Gishizsky, , Mikhail 

7 Sures, , Irman G . 
8 

9 (ii) TITLE OF THE INVENTION: NOVEL MEGAKARYOCYTE PROTEIN 

10 TYROSINE KINASES 
11 

12 (iii) NUMBER OF SEQUENCES: 21 
13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Pennie & Edmonds 

16 (B) STREET: 1155 Avenue of the Americas 

17 (C) CITY: New York, 

18 (D) STATE: NY 

19 (E) COUNTRY: USA 

20 (F) ZIP: 10036-2711 
21 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette 

24 (B) COMPUTER: IBM Compatible 

25 (C) OPERATING SYSTEM: DOS 

26 (D) SOFTWARE: FastSEQ Version 2.0 
27 

28 (vi) CURRENT APPLICATION DATA: 

0 £ 29 (A) APPLICATION NUMBER: 08/426-509 

30 (B) FILING DATE: 21-APR-1995 

31 (C) CLASSIFICATION: 
32 

33 (vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: 08/232,545 

35 (B) FILING DATE: 
36 

37 
38 

3 9 (Viii) ATTORNEY/AGENT INFORMATION: 

40 (A) NAME: Coruzzi, Laura A m 

41 (B) REGISTRATION NUMBER: 30,742 

42 (C) REFERENCE/DOCKET NUMBER: 7683-00/74-999 
43 

44 (ix) TELECOMMUNICATION INFORMATION: 

45 (A) TELEPHONE: 212-790-9090 * 

46 (B) TELEFAX: 212-869-9741 
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RAW SEQUENCE LISTING DATE: 07/10/97 

PATENT APPLICATION US 708/426, 509 »A TIME: 14:19:55 

INPUT SET: S18865. raw 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTCGCTCCAA GTTGTGCAGC CGGGACCGCC TCGGGGTGTG CAGCCGGCTC GCGGAGGCCC 60 

TCCTGGGGGC GGGCGCGGGG CGGCTCGGGG GCGCCCCCTG AG C AG AAA AC AGGAAGAACC 120 

AGGCTCGGTC CAGTGGCACC CAGCTCCCTA CCTCCTGTGC CAGCCGCCTG GCCTGTGGCA 180 

GGCCATTCCC AGCGTCCCCG ACTGTGACCA CTTGCTCAGT GTGCCTCTCA CCTGCCTCAG 240 

TTTCCCTCTG GGGGGCGATG GCGGGGCGAG GCTCTCTGGT TTCCTGGCGG GCATTTCACG 300 

GCTGTGATTC TGCTGAGGAA CTTCCCCGGG TGAGCCCCCG CTTCCTCCGA GCCTGGCACC 360 

CCCCTCCCGT CTCAGCCAGG ATGCCAACGA GGCGCTGGGC CCCGGGCACC CAGTGTATCA 420 

CCAAATGCGA GCACACCCGC CCCAAGCCAG GGGAGCTGGC CTTCCGCAAG GGCGACGTGG 480 

TCACCATCCT GGAGGCCTGC GAGAACAAGA GCTGGTACCG CGTCAAGCAC CACACCAGTG 540 

GACAGGAGGG GCTGCTGGCA GCTGGGGCGC TGCGGGAGCG GGAGGCCCTC TCCGCAGACC 600 

CCAAGCTCAG CCTCATGCCG TGGTTCCACG GGAAGATCTC GGGCCAGGAG GCTGTCCAGC 660 

AGCTGCAGCC TCCCGAGGAT GGGCTGTTCC TGGTGCGGGA GTCCGCGCGC CACCCCGGCG 720 

ACTACGTCCT GTGCGTGAGC TTTGGCCGCG ACGTCATCCA CTACCGCGTG CTGCACCGCG 780 

ACGGCCACCT CACAATCGAT GAGGCCGTGT TCTTCTGCAA CCTCATGGAC ATGGTGGAGC 840 

ATTACAGCAA GGACAAGGGC GCTATCTGCA CCAAGCTGGT GAGACCAAAG CGGAAACACG 900 

GGACCAAGTC GGCCGAGGAG GAGCTGGCCA GGGCGGGCTG GTTACTGAAC CTGCAGCATT 960 

TGACATTGGG AGCACAGATC GGAGAGGGAG AGTTTGGAGC TGTCCTGCAG GGTGAGTACC 1020 

TGGGGCAAAA GGTGGCCGTG AAGAATATCA AGTGTGATGT GACAGCCCAG GCCTTCCTGG 1080 

ACGAGACGGC CGTCATGACG AAGATGCAAC ACGAGAACCT GGTGCGTCTC CTGGGCGTGA 1140 

TCCTGCACCA GGGGCTGTAC ATTGTCATGG AGCACGTGAG CAAGGGCAAC CTGGTGAACT 1200 

TTCTGCGGAC CCGGGGTCGA GCCCTCGTGA ACACCGCTCA GCTCCTGCAG TTTTCTCTGC 1260 

ACGTGGCCGA GGGCATGGAG TACCTGGAGA GCAAGAAGCT TGTGCACCGC GACCTGGCCG 1320 

CCCGCAACAT CCTGGTCTCA GAGGACCTGG TGGCCAAGGT CAGCGACTTT GGCCTGGCCA 1380 

AAGCCGAGCG GAAGGGGCTA GACTCAAGCC GGCTGCCCGT CAAGTGGACG GCGCCCGAGG 1440 

CTCTCAAACA CGGGAAGTTC ACCAGCAAGT CGGATGTCTG GAGTTTTGGG GTGCTGCTCT 1500 

GGGAGGTCTT CTCATATGGA CGGGCTCCGT ACCCTAAAAT GTCACTGAAA GAGGTGTCGG 1560 

AGGCCGTGGA GAAGGGGTAC CGCATGGAAC CCCCCGAGGG CTGTCCAGGC CCCGTGCACG 1620 

TCCTCATGAG CAGCTGCTGG GAGGCAGAGC CCGCCCGCCG GCCACCCTTC CGCAAACTGG 1680 

CCGAGAAGCT GGCCCGGGAG CTACGCAGTG CAGGTGCCCC AGCCTCCGTC TCAGGGCAGG 1740 

ACGCCGACGG CTCCACCTCG CCCCGAAGCC AGGAGCCCTG ACCCCACCCG GTGGGGCCCT 1800 

TGGCCCCAGA GGACCGAGAG AGTGGAGAGT GCGGCGTGGG GGCACTGACC AGGCCCAAGG 1860 

AGGGTCCAGG CGGGCAAGTC ATCCTCCTGG TGCCCACAGC AGGGGCTGGC CCACGTAGGG 1920 

GGCTCTGGGC GGCCCGTGGA CACCCCAGAC CTGCGAAGGA TGATCGCCCG ATAAAGACGG 1980 

ATTCTAAGGA CTCTAAAAAA 2000 

A 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 507 amino acids 
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PATENT APPLICATION US/08/426,509A TIME: 14:19:57 

INPUT SET: S18865.mw 
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(ii) MOLECULE 


TYPE : None 
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RAW SEQUENCE LISTING DATE: 07/ 1 0/97 

PATENT APPLICATION US/08/426,509A TIME: 14:19:59 

INPUT SET: S18865. raw 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGCTTTTTG CTTAGAGCTT GAGAGTCAAA GTTAAGGACC CACATGTATA CTTCGGCTCT 60 

AGCGAGTCTA AGGATGATAA TATGGATACA AAATCTATTC TAGAAGAACT TCTTCTCAAA 120 

AGATCACAGC AAAAGAAGAA AATGTCACCA AATAATTACA AAGAACGGCT TTTTGTTTTG 180 

ACCAAAACAA ACCTTTCCTA CTATGAATAT GACAAAATGA AAAGGGGCAG CAGAAAAGGA 240 

TCCATTGAAA TTAAGAAAAT CAGATGTGTG GAGAAAGTAA ATCTCGAGGA GCAGACGCCT 300 

GTAGAGAGAC AGTACCCATT TCAGATTGTC TATAAAGATG GGCTTCTCTA TGTCTATGCA 360 

TCAAATGAAG AGAGCCGAAG TCAGTGGTTG AAAGCATTAC AAAAAGAGAT AAGGGGTAAC 420 

CCCCACCTGC TGGTCAAGTA CCATAGTGGG TTCTTCGTGG ACGGGAAGTT CCTGTGTTGC 480 

CAGCAGAGCT GTAAAGCAGC CCCAGGATGT ACCCTCTGGG AAGCATATGC TAATCTGCAT 540 

ACTGCAGTCA ATGAAGAGAA ACACAGAGTT CCCACCTTCC CAGACAGAGT GCTGAAGATA 600 

CCTCGGGCAG TTCCTGTTCT CAAAATGGAT GCACCATCTT CAAGTACCAC. TCTAGCCCAA 660 

TATGACAACG AATCAAAGAA AAACTATGGC TCCCAGCCAC CATCTTCAAG TACCAGTCTA 720 

GCGCAATATG ACAGCAACTC AAAGAAAATC TATGGCTCCC AGCCAAACTT CAACATGCAG 780 

TATATTCCAA GGGAAGACTT CCCTGACTGG TGGCAAGTAA GAAAACTGAA AAGTAGCAGC 840 

AGCAGTGAAG ATGTTGCAAG CAGTAACCAA AAAGAAAGAA ATGTGAATCA CACCACCTCA 900 

AAGATTTCAT GGGAATTCCC TGAGTCAAGT TCATCTGAAG AAGAGGAAAA CCTGGATGAT 960 

TATGACTGGT TTGCTGGTAA CATCTCCAGA TCACAATCTG AACAGTTACT CAGACAAAAG 1020 

GGAAAAGAAG GAGCATTTAT GGTTAGAAAT TCGAGCCAAG TGGGAATGTA CAGSAGTGTCC 1080 

TTATTTAGTA AGGCTGTGAA TGATAAAAAA GGAACTGTCA AACATTACCA CGTGCATACA 1140 

AATGCTGAGA ACAAATTATA CCTGGCAGAA AACTACTGTT TTGATTCCAT TCfJAAAGCTT 1200 

ATTCATTATC ATCAACACAA TTCAGCAGGC ATGATCACAC GGCTCCGCCA CCCTGTGTCA 1260 

ACAAAGGCCA ACAAGGTCCC CGACTCTGTG TCCCTGGGAA ATGGAATCTG GGAACTGAAA 1320 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/426, 509 A 



DATE: 07/10/97 
TIME: 14:20:02 



INPUT SET: SI 8865. raw 
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AGAGAAGAGA 
GGCAAGTGGA 
GAAGATGAAT 
AAATTCTATG 
AATGGCTGCT 
TTAGAAATGT 
CACCGGGACT 
GACTTTGGAA 
TTTCCAGTCA 
GACGTATGGG 
GACTTGTATG 
CCCCACCTGG 
GAAAAGCGTC 
AAGCATTGAA 
TTCATTCATT 
TCTCTGTATT 
TTTAGGTCAA 
GCAGAAGCAC 
AACTGAAAAA 
TTAAATATAC 



TTACCTTGTT 
AGGGGCAGTA 
TCTTTCAGGA 
GAGTGTGTTC 
TGCTGAATTA 
GCTACGATGT 
TGGCTGCTCG 
TGACAAGGTA 
AGTGGTCAGC 
CATTTGGGAT 
ACAACTCCCA 
CATCGGACAC 
CCACATTTCA 
GAAGAAATTA 
TTAAGGAAAG 
GTCTATTATT 
ATTAGTAATT 
ATTTTCAGAC 
TTACTTATTG 
TACCAAGTAC 



GAAGGAGCTG 
TGATGTTGCT 
GGCCCAGACT 
AAAGGAATAC 
CCTGAGGAGT 
CTGTGAAGGC 
TAACTGCTTG 
TGTTCTTGAT 
TCCAGAGGTG 
CCTGATGTGG 
GGTGGTTCTG 
CATCTACCAG 
GCAACTCCTG 
GGAGTGCTGA 
TAGCAAGGCA 
TAGAAATGAA 
TTGTTTATGC 
TGCAATATAG 
GATATTCATT 
AAAAAAAAAA 



GGAAGTGGCC 
GTTAAGATGA 
ATGATGAAAC 
CCCATATACA 
CACGGAAAAG 
ATGGCCTTCT 
GTGGACAGAG 
GACCAGTATG 
TTTCATTACT 
GAGGTGTTCA 
AAGGTCTCCC 
ATCATGTACA 
TCTTCCATTG 
TAAGAATGAA 
TAATGTAATT 
CAAGGCAGGA 
TGCCCCTGAT 
AGACTGTGTT 
CTTTTCTTTA 
AAAAAAAAAA 



AGTTTGGAGT 
TCAAGGAGGG 
TCAGCCATCC 
TAGTGACTGA 
GACTTGAACC 
TGGAGAGTCA 
ATCTCTGTGT 
TCAGTTCAGT 
TCAAATACAG 
GCCTGGGGAA 
AGGGCCACAG 
GCTGCTGGCA 
AACCACTTCG 
TATAGATGCT 
TAGCTAGTTT 
AACAAAAGAT 
ATAACACTTT 
CATGTGTAAA 
TATTGTCATT 



GGTCCAGCTG 
CTCCATGTCA 
CAAGCTGGTT 
ATATATAAGC 
TTCCCAGCTC 
CCAATTCATA 
GAAAGTATCT 
CGGAACAAAG 
CAGCAAGTCA 
GCAGCCCTAT 
GCTTTACCGG 
CGAGCTTCCA 
GGAAAAAGAC 
GGCCAGCATT 
TTAATAGTGT 
TCCCTTGAAA 
CCAGCCTATA 
GACTGAGCAG 
GTCACAACAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



239 


Met 


Asp 


Thr 


Lys 


Ser 


He 


Leu 


Glu 


Glu 


Leu 


Leu 


Leu 


Lys 


Arg 


Ser 


Gin 


240 


1 








5 










10 










15 




241 


Gin 


Lys 


Lys 


Lys 


Met 


Ser 


Pro 


Asn 


Asn 


Tyr 


Lys 


Glu 


Arg 


Leu 


Phe 


Val 


242 








20 










25 










30 






243 


Leu 


Thr 


Lys 


Thr 


Asn 


Leu 


Ser 


Tyr 


Tyr 


Glu 


Tyr 


Asp 


Lys 


Met 


Lys 


Arg 


244 






35 










40 










45 








245 


Gly 


Ser 


Arg 


Lys 


Gly 


Ser 


He 


Glu 


He 


Lys 


Lys 


He 


Arg 


Cys 


Val 


Glu 


246 




50 










55 










60 










247 


Lys 


Val 


Asn 


Leu 


Glu 


Glu 


Gin 


Thr 


Pro 


Val 


Glu 


Arg 


Gin 


Tyr 


Pro 


Phe 


248 


65 










70 










75 










80 


249 


Gin 


He 


Val 


Tyr 


Lys 


Asp 


Gly 


Leu 


Leu 


Tyr 


Val 


Tyr 


Ala 


Ser 


Asn 


Glu 


250 










85 










90 










95 




251 


Glu 


Ser 


Arg 


Ser 


Gin 


Trp 


Leu 


Lys 


Ala 


Leu 


Gin 


Lys 


Glu 


He 


Arg 


Gly 


252 








100 










105 










11* 






253 


Asn 


Pro 


His 


Leu 


Leu 


Val 


Lys 


Tyr 


His 


Ser 


Gly 


Phe 


Phe 


Val 


Asp 


Gly 


254 






115 










120 










1&5 








255 


Lys 


Phe 


Leu 


Cys 


Cys 


Gin 


Gin 


Ser 


Cys 


Lys 


Ala 


Ala 


Pro 


Gly Cys 


Thr 


256 




130 










135 










140 


• 








257 


Leu 


Trp 


Glu 


Ala 


Tyr 


Ala 


Asn 


Leu 


His 


Thr 


Ala 


Val 


Asn 


Glu 


Glu 


Lys 


258 


145 










150 










155 










160 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2500 



« 



PAGE: 1 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/426,509A 



DATE: 07/10/97 
TIME: 14:20:04 



INPUT SET: S18865.mw 



Line 
29 



Error 

Wrong application Serial Number 



Original Text 

(A) APPLICATION NUMBER: 08/426-509 



